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Title: In Vivo Recombination 

Field of the invention 

The present invention relates to a method for in vivo 
5 recombination of homologous DNA sequences. The method is a 

forced artificial evolution resulting in a DNA sequence encoding 
a polypeptide having an advantageous property. 

Background of the invention 
xq Homologous recombination between DNA sequences placed on 

the same plasmid is well known to the skilled artisan (Weber and 
Weissmann, Nucl. Acid. Res. (1983) -11, 5661-5669). EP 252666 Bl 
(Novo Nordisk A/S) , W095/22625 Al (Affymax Technologies N.V.) 
and WO9101087 discloses in vivo recombination of genes placed on 
15' the same plasmid. 

E? 449923 Bl (Setratech) discloses a process of inter- 
■ genie recombination in vivo of partially homologous DNA se- 
quences. The recombination takes place' in cells of which the en- 
zymatic mismatch repair system is. defective. 
20 J. Biotechnol. (1991) 19, 221-240 discloses a process 

for inactivating a gene in. the genome of B. Amyloliquefaciens. 
The process involves integration and excision of a pE194 mutant. 

In Chemical Abstracts 118:161925 (1993)' the minimal se- ' 
quence length of homology necessary for recombination . is deter- 
25 mined. The method of determination involves integration of a ■ 

pE194 derivative carrying a _gluconase gene. 

In J. Bacteriol. (1982) 152, 524-526 a plasmid pBD9 , 
which, comprises the two plasmids from' Staphylococcus aureus, 
■pE194 and pUBllO, is disclosed. Other plasmids based on pE194 .or 
30* pUBHO are disclosed in Molecular and General Genetics (1988), 
213 , 465-70,' Molecular and General. Genetics (1984), 195, 374-7,. 
Plasmid (1981), 6, 67-77 ■ and Genetika , (1986) 22, 2750-7. 

In Yichuan Xuebao .(1993)/ 20, 272-8 (see Chemical Ab- 
stracts 120:1670 (1994) ),, Chen et . Al. disclose the plasmid 
35 pNWl02, a pE194 derivative, in which the thermostable a-amylase 
gene from Bacillus licheniformis is inserted. pNW102 was trans- 
formed into the Bacillus subtilis strain BF 7658, followed by 
incubation at non-permissive temperature, . allowing homologous 
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recombination between the a-amylase . genes of pNW102 and. BF 7658. 
The a-amylase produced of 'the recombinant strains shows the same 
characteristics as the a-amylase from Bacillus* licheniformis . 

However, in the related art there is no indication of 
5 that DNA. sequences can.be recombined 'in vivo by repeating inte- 
gration and cross-out in order to. obtain a gene encoding a ' 
polypeptide with improved , characteristics , .such as increased 
. stability, improved specificity or higher activity., 

.Accordingly, .a problem of the invention. is 'to provide an 
b improved in vivo process of . generating new DNA sequences. 

Brief Description of the invention 

The present invention is directed to a "in vivo process 
of generating new DNA sequences . . It has turned -out that new .'DNA 
s sequences, with improved properties can be generated in a -fast: 
.and efficient 'way . from at least two homologous DNA sequences .by 
a process., involving in vivo •• exchange of DNA between at. least two 
* 'vectors, each containing "a homologous DNA sequence and ah origin 

of replication. w . ' t .. / ' * * .. .. ' 

o '.The, process -of "the invention' is characterized by; that? : 

sdid exchange, o : f 'said homologous sequences' is done by repeating : 
' in vivo at least, once,;, said integration' and excision . of one' vec- • 
'tor into the other". . ; V * ■ ' " K . '* . < ' * 

Accordingly, in a first aspect the invention relates to 
5 a process f or in ' vivo recombinat ion- of homologous DNA sequences', 
comprising the following steps - :. 

(a) % , incubating., a cell '.containing at least two DNA' 1 struc- 

';; tures , each 'comprising a DNA sequence and 'an? origin- of 

/ ■ ' • " replication, under conditions which. favour Integration 
0 ■ .. of one of the DNA-structures into one of the others, 

thereby forming a hybrid DNA structure; ■ " 

(b) ' incubating the cell under conditions which favour cross- 

ing out from said hybrid DNA structure, thereby forming 
' novel DNA structures, each comprising a recombined DNA 
is sequence and an origin of replication; 

(c) repeating steps a)-b) at least once. 
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One of the advantages of the in vivo recombination 
process of the invention is by repeating the' exchange in vivo . 
between the DNA sequences it is possible to obtain numerous di 
5 ferent recombination patterns between said DNA sequences. 

This is illustrated in figure 1 herein, which. shows the 
result of in vivo recombination between the two DNA sequences 
Savinase and Savisyn performed as described in example 1, 

In figure 1 Savinase/Savisyn recombined clones such as 
10 clone no. 3, 8 12 are derived from more than one in vivo recom- 
bination event between the two DNA sequences. 

Brief description of the drawing 

Figure 1 illustrates' the recombination of two DNA se- 
15 quences,- placed on the plasmids pTR, temperature resistant sine 
origin of replication is functional at 50°C, and pTS, tempera- 
ture sensitive since origin of replication not functional over 
45°C. Under non-permissive conditions (i.e. at 50°C) , pTS inte- 
grates into pTR, forming a hybrid piasmid. 
io Figure .2 illustrates the cross out. from the hybrid plas 

mid, - forming two new DNA. sequences , each t placed on a piasmid. 
Repeated crossing- in and out may be- done by e.g. temperature cy- 
cling or simply by keeping temperature pressure and selecting 
for resistance of pTS (i.e. Ern R ) . ' ,* 

;s Figure 3 illustrates the plasnids which are obtained af 

J - ter - repeated integration -and -crossing out events'. ~ : \ . - 

Further, figures 1 and. 2 illustrate a DNA structure sys 
tern suitable for in vivo measurement of hybrid structure forma- 
tion, and suitable for performing a forced cross out of the DNA 
30 hybrid structure. . 

Fig. 1: DNA structure pTS : 

Suitable active promoter directing ex- 
pression of active protein 
Erm R : antibiotic resistance gene 
35 DNA structure pTR comprises: ; 

Not any active promoter to direct tran- 
scription of GFP 
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GFP: Green Fluorescent Protein 
The Erm R antibiotic resistance gene in pTS ..is active due 

Ito the active promoter. \ 7 / . ,. . ' ; 

The GFP gene in pTR is inactive (i.e. the gene is non-. 
5 transcribed); due to .there ' is- not ah active promoter tadriye, ex- 
pression, of the gene. ' • , 

Fig. 2: After formation of the hybrid; structure accord- 

■ ing \to the invention GFP is . now active (driven by the. promoter) . 
This makes it possible to in; vivo measure the hybrid structure ; 

o formation (see below for further details) . ' . 

' >. . 1 i n contrary the Erm R antibiotic resistance gene is now 

inactive (i . e . the gene \ is: non-transcribed) due to there is. hot : 

an active promoter to drive expression of the gene . .'Forced . cross . 

out 'of the DNA hybrid structure is performed by incubating'the 
5 - cells - under ,Erm R antibiotic pressure/ thereby forcing a cross' 

out of the DNA hybrid, structure./, since the cell is. only able to 
' express the Erm R antibiotic'' resistance gene after • the crossing; , ' 

out of 'the DNA structure has taken. place 1 . 
; ' • " Figures 4 and 5: Illustrate the two vectors ..p5X120 v(f ig. 

o 4 ) • an d pMB4 3 0 (fig.'. 5) used as described in example 1. to ' in vivo 

recombine the two "DNA sequences '' Savinase and Savisyn. ,\ v V- [ . • 
• ' pSX12.0 comprises ij an open reading frame 'encoding- Sav- 

inase, ' ii) an active'- promoter directing transcription of Sav- ' 
•"••inase,. and iii)' a gene conferring, .resistance .t^o chloramphenicol 

: . pMB4 : 30 comprises i) a, Savizyn DNA* fragment, ii) a termi- 

nator (resulting in no transcription through the regions follow- 
- • ing this)', iii) a gene conferring resistance to .erythromycin . - ; 

(Ern R ) , and iv) an- open reading- frame- encoding ' Green Fluorescent 
0 Protein (GFP) . *' ' ,\ ; i: r-;;-* \^'^;;7 ; - ; * " ■> ^^-■^:^ r \*? t :- — ; 
Fig. 6:. Illustrates the results of Savinase/Savisyn 'in' 

■ vivo recombination, performed as described in example 1 . - 

The schematic representation shows 12 of the shuf- 
f led/recombined clones, 
is " Horizontally are given the positions of the 

restriction sites unique for Savizyn. 

The- letter "S" denotes that the sequence at this position 
is identical to the Savinase sequence and the letter "2" denotes 
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that the sequence stems^ from Savizyn.. All of the sequenced 
clones are different in their pattern of recombination. 

Definitions 

5 As used herein, the following terms have the following 

meanings: 

The term "DNA sequence 11 includes any DNA sequence which 
is desirable to modify according to the invention e.gr. a DNA se- 
quence encoding, a polypeptide, ' for example an enzyme, pharma- 

10 ceutically active polypeptides, e.g. insulin growth hormone hu- 
man hormones or growth regulators, and sequences such as promot- 
ers, transcription or translation .regulators and other intracel- 
' lular polynucleotides and polypeptides'/ 

The term "homologous" means the' DNA sequences have some 

15 . stretches of identical nucleotides, allowing recombination in 
vivo. Said stretches within the DNA sequences are at least 15 ^ 
base pair long, more preferably at least 25 base pair long, even 
more preferably at least 50 base, long, and most preferably at 
least 150 base pair or more. 

20 The term "homologous" comprises stretches of DNA 

sequences that are from 60% identical to DNA sequences that 
differs in only one nucleotide. It is preferred that the' 
stretches of DNA sequences are more than 70% identical, more 
preferably at least 80%, especially more than 90% identical, an'd 

25 more preferred that the stretches of DNA sequences are. 95% 
identical, most preferred at least 97%. 

Within the stretches of the DNA sequences of interest, 
the DNA sequence homology referred to above is determined as the 
degree of identity between two sequences indicating a derivation 

30 of the first sequence from the second. The homology may suitably 
be determined by means of computer programs known in the art, 
such as GAP provided in the GCG program package . (Program Manual 
for the Wisconsin Package, Version 8, August 1994, Genetics 
Computer Group, 57 5 Science Drive, Madison, Wisconsin, USA 

35 53711) (Needleman, S.B. and Wunsch, CD., (1970) Journal of Mo- 
lecular Biology, 48, 443-453). Using GAP with the following set- 
tings for DNA sequence comparison: GAP creation penalty of 5.0 
and GAP extension penalty of 0.3, the homologous DNA sequences 
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of interest exhibits a degree ;; of identity preferably ..as de- .: 

scribed above. 

The term "Recombine.d DNA sequence" means the DNA se- 
quence. which is a result of a recombination event between at.; 
■5 least two, DNA sequences.. The recombination event may be a single 

recombination event or. may be first, a recombination event be- 
, tween at least two DNA sequences followed by; a cross out at a 
'different site than the first 'recombination event. 

The term "DNA structure" means a DNA molecule, e.g. a 
10. vector, a plasmid,, a chroaosorce (e.g. .a Bacillus subtilis chro- 
■* mosome)," a virus , a phage , a " t r ansposoh or a' genome-.;' v - 

The term "vector" means any: linear or circular DNA mo- /■■ 
lecule that is able to contain a. DNA sequence to be recombined ; 
and an origin of replicat ion ."' The term includes any vector known 
15 to the skilled artisan e. g . - a- •pla'smid. or^ a phage .' The DNA se- .,; 
' guence is optionally-but hot necessarily- under' control, of a ' 

. promoter . 

The term "a cell" used herein in connection .'with a in 
vivo recombination process of the invention' is intended to cover.' 
2d both "a cell" and "a cell population".;' / . 

The term "transferring" means 'that the DNA structure is 
. introduced into a cell.. The DNA structure 'may be introduced using , 
/ techniques known in the art for direct ' introduction of DNA, e:g. [ 
■ / by' use of/electrdporation, transforrr.at.ion of competent cells, pro- 
• 2E> toplast transformation, transf ection, transduction,, conjugation or 
ballistic transf ormatioh-. _ ; _ ^ ' - • '• 

..The tern "incubating"' (or cultivating, growing), means- 
• that the cell, is treated in such a way that the normal cell •. 
''. .,''•' functions -are .active,' especially' the functions necessary for re-: 
'- 30 combination. The- cell' can be- incubated on a/ solid, substrate , ... 
' (agar petri disc) or the. cell can be incubated under submerse - 
' • conditions.. • It is preferred that the cell is incubated in a test 
tube, e.g. an eppendorph test tube. .In the case of a tempera-' 
ture-sensitive origin of replication, the test tube is conven-. 
35 iently placed in. an automatic terr.ocycler . Further the cell may 
be' incubated in standard shake- flaks . 

The term "origin of replication" (ori) means a region 
where replication of a DNA structure is initiated. The term 
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functional in connection with origin of replication means that 
replication* can be initiated from the origin. The term non- 
functional means that no replication is initiated under the 
given conditions, or that the initiation of replication is redu 
s 'ced, e.g. to a level insufficient for survival of a cell con- 
. taining a DNA structure with a marker gene and the ori, under 
appropriate selection pressure. 

The term "temperature sensitive" in connection with ori- 
gin of replication or vector means the vector is unable to rep- 
10 licate at e.gr. increased temperatures, i.e. non-permissive con- 
ditions, which' yet permit growth of the parent cell. 

The term "temperature resistant" in connection with 
■ origin of replication or vector means the vector is able to 

replicate at increased temperatures. 
15 The 'term "DNA library" is a library comprising numerous 

different DNA sequences. A DNA library may be made according to. ; 
standard procedures known in the art, such as e.g. by in vitro 
DNA shuf f ling/DNA -recombination (WO 95/17413, WO 95/22625) 
and/or Error-prone PCR and cassette mutagenesis. 
20 A DNA library herein may co-prise as low as 2-20.' 

different sequences, as e.g. a library where only one amino acid 
is varied. Further a DNA library may comprise numerous different 
'DNA sequences, e'.gr. up to, 10 i0 different sequences. 

2 5 Detailed Description' of the Invention 

The present invention comprises a process for in. vivo 
recombination of homologous DNA sequences, comprising the steps 

(a) incubating- a cell containing at least two DNA struc- 
tures, each comprising^ a DNA sequence and ah origin of 

30 replication, under conditions where i) at least one ori- 

gin- of replication is non-functional and ii) the cell i: 
under -a selective pressure which only allows growth of 
the cell if it contains the DNA structure which is' hav- 
ing the non-functional origin (i.e. favouring an, inte- 

35 gration of one of the DNA-structures into one of the 

others, thereby forming a hybrid DNA structure); • 

(b) changing of said conditions to conditions, where fewer 
origins ,of replication are non-functional than in step 
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a) ( i . e . favouring a ..crossing ..out, from said hybrid DNA 
structure); 

(c) repeating steps. a) -b) at least once. ■ ■ 

5 The recombined DNA.seguence will encode new polypeptides 

having new properties. It is then possible to incorporate a se- 
lection or screening system in the process of the/invention in 
order to select and/or screen for desired properties encoded by 
of or more of the recombiried DNA sequence(s) . 
io The number of repetition (s) in step (c) above may be one 

1 repetition or it ..may be more such as 5-10 repetitions or 50-100 
or even higher numbers of repetitions. 

It is \pref erred- that integration is forced (or made fa- t 
vorable for survival of -the J)NA structure) by incubation of the 
15 cell under such conditions where the origin of ■ replication of a 
DNA structure, is non-functional/ and. that- cross' ,but ,'is. forced bv 
incubation of the cell under such- conditions where the origin*. of 
replication of- a DNA structure again is, functional . 

In another preferred embodiment the routine depicted 
20 above in the steps (a) to (c) it is preferred that the cells are 
., kept under constant" conditions (e.g. constant temperature and 
selection pressure) where spontaneous cross-in and cross-out 
happens. That is.- the shuffling procedure, in. the cells', is re- 
peated without changing the conditions repeatedly. 
25 in a number , of DNA structures, .such as in many Bacillus ■ 

DNA structures (vectors) {Bacillus subtilis and Other- Gram- 
Positive Bacteria , ..Spnensheim- et- al., 1993 , American Society for 
. ■ Microbiology, Washington D.C,.,) the origin -of .replication -com- 
■ prise a ori ( + ) and a ori (-')".• ori(-f) controls where, replication . 
30* of the first , DNA strand "starts, and ori (-) controls the ini- 
tiation point for replication on the second strand (Bacillus 
subtilis and Other Gram-Positive Bacteria, Sonensheim et al., 
1993 , American Society for Microbiology, Washington D.C.)-. 

In an embodiment of the process of the' invention, at 
35. least one DNA structure does not comprise ori(-) (i.e. only com- 
prise ori(+))- preferably the. DNA structure which does not com- 
prise ori(-) is a DNA structure which origin (i.e. herein 
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ori(+)) is functional during the hybrid DNA structure formation 
performed according to the invention . 

One advantage of this embodiment of the invention is 
when only ori(+) is functional in said DNA structure, this DNA 
5 structure is in a single-stranded state for relatively longer 
period, compared to a DNA structure comprising both ori(+) and 
ori (-) - This prolonged single-stranded state facilitate the hy- 
brid DNA structure formation according to a process of the in- .- 
vention, 

10 Preferably, the DNA structure which does, not comprise 

ori(-) as described above is a, Bacillus structure, more pre- 
ferred a Bacillus vector. 

In order to illustrate this concept, an example is a 
■ modified pTR (TR: temperature resistence) Bacillus vector which 

15 does not comprise ori(-) (see figure 1). If the hybrid formation 
is performed. at 50°C only the origin of pTR is functional 
(origin of pTS (TS: temperature sensitive) is not functional) 
(see fig. 1,2); The DMA structure pTR only comprise ori(-f) and'-' 
is therefore in a single-stranded state for relatively longer"' 

20 period, which facilitate the recombination event and conse- 
quently hybrid formation with the pTS DMA structure "(see fig. '** 
1,.2) .; 

It is more preferred that one DNA structure is a vector 
which is able to replicate under certain (permissive) conditions 

25 "and. unable to replicate under other (non-permissive) conditions. 

... The vector, may ,..:f or instance, be one which is temperature-- 

sensitive for replication. Thus, the vector may be one which is 
unable to replicate at increased temperatures , . which yet' permit 
growth of the parent cell. The ceil is initially incubated at a 

.30- temperature permitting vector .replication and subsequently after 
integration into the other. DNA structure may have taken place, 
incubated at a temperature which does not permit vector repli- 
cation so that the vector is lost from the cells unless inte- 
grated. '* . 

35 The vector may further comprise a selectable marker. In' 

this case, the incubation at the non-permissive temperature may 
be conducted under selective conditions to, ensure that only * 
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cells containing the integrated vector (which includes the DNA 
sequence and the selectable marker), will survive. 

Further, it is preferred to construct a system where it 
is possible in vivo to measure formation of the hybrid struc- 
ture. . • ■ 

Preferably, this, is done by constructing a system 
wherein an in vivo, screenable or selectable protein is- expressed 
only after the hybrid structure has been formed. ' 

Preferably, said screenable protein is a protein which 
is fluorescent under suitable exposure, and in particular said 

• fluorescent protein is Green Florescent protein (GFP) or vari- 

ants thereof . . 

Green 'Florescent protein (GFP) or variants thereof and hov 
the GFP 1 s are activated to fluoresce are described in the art 
and reference is made to (Crameri et al; Nature Biotechnology 
14:315-319 (1996); and Cormack et a 1 . GENE 173:33-38,(1996)) for 

further details. 

One of the advantages of using such a screenable protein'- .'■ 
which is' fluorescent under suitable exposure of light, and in 

> particular GFP or variants thereof, is that 'it is then possible 
by e.g. fluorescent-activated ceil sorting-(FACS) technology to 
selectively sort out .cells which comprise said hybrid structure . 
This nay eliminate background of ' cells , which do not comprise 
said hybrid structure after performing of , step, a) according -to 

5 the first aspect "of the., invention . / * .... • 

FACS is a widely known screening technology and reference - 
is made to- (Cormack et al. ' 'TACS--optimized mutants of the green 
fluorescent protein (GFP) M . GENE 173 : 33-38 ( 1996 ) ) for further 

• details..-' - 

0 • ... In order to illustrate a suitable system enabling, in , 

vivo, measurement of. the formation of the hybrid structure , an 
example of a suitable system is shown in figure 1 and 2 . 

■ Alternatively to the use of a screenable protein as dis- 
cussed above, it may be advantageously to use a selectable 

35 marker,- wherein said selectable marker is only transcriptionally 
active when the hybrid structure is formed. 
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Further, it is preferred to that the crossing out from 
the hybrid DNA structure, in 'step b) of the first aspect of the 
invention, is a forced crossing out event. 

Forced crossing out from the. hybrid structure may be 
5 performed by: 

. i) - constructing a system wherein the DNA hybrid' structure 
does not express a selectable marker, and at'.least one 
of the at least two DNA structures in step a) of first 
aspect is able to express said' selectable marker; 
io ■ ii) ; incubating the cell under selectable conditions, 

thereby forcing a cross out of the DNA hybrid struc- 
ture, since the cell is- only able to express the 
marker gene after the crossing out of the DNA /hybrid 
structure has taken place. 
15 Preferably, selectable marker "gene mentioned above is 

expressed on a DNA structure' where the origin of replication' is^ 
. non-functional during a forced integration of the -DNA structures 
into a DNA hybrid structure performed as described above, and I- 
preferably the selectable marker' gene is an antibiotic resis- i;t , 
20 tance'gene. See below for further' description of suitable se- 
lectable marker genes. , 

In order to illustrate the concept of ■ forced crossing 
• out from the' hybrid DNA structure,* an example of a suitable sys- 
tem is shown in figure2 1 and 2. * 
25 The selectable marker • may be any marker known in the art, 

' -for instance, a g^ene coding for a product which confers anti- 
biotic resistance to the; cell, which confers prototrophy to .an , 
auxotrophic strain, or which complements a defect of the host,, 
(e.g. dal' genes introduced in : a dal- strain; cf. B. Diderichsen 
30 (1986) , Bacillus :' Molecular Genetics and. Biotechnology Applica- 
tions', A . T-. Ganesan and .J. A.- Koch, Eds.., Academic Press , pp. 35- 
46.. The selectable marker may, e,;g; , • be' excised from a known 
source or' present on a vector, e.g. a plasmid, used for the con- 
struction of the DNA structure to be used in the method of the 
35 invention.. 

This selection may be accomplished by growing' the cells 
under selection pressure for the 'selection marker encoded by the 
marker- gene. V . " 
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The DNA structure (in which the vector is to be 
integrated) may contain a gene encoding any selectable marker, 
e.g. of any type as described above in connection with the 
marker optionally carried by the vector to be used in the method- 

s of the invention. Thus, the marker gene may encode an antibiotic 
resistance, such as resistance to kanamycin, tetracyclin, 
ampicillin, erythromycin, chloramphenicol, or a resistance to 
various heavy metals such as selenate, antimony or arsenate. 

. In a preferred embodiment, the cell used in the process 
is defective or transitorily inactivated in the enzymatic mis- 
match repair system or has a reduced level of mismatch repair. 
In the synthesis of DNA , errors can occur and the resulting non- 
complementary pairs of bases are called mismatches . A process 
for the correction of errors (mismatches) in DMA exists. In E . 

5 coli the errbrs are very rapidly and accurately detected by two 
enzymes (MutS and MutL). enabling a third enzyme (MutU) to unwind 
the two DNA strands- and a' fourth, enzyme. -(MutH) to cut the neo- 
synthesized strand on a sequence, of DNA (GATC) itself methylated 
later. For a closer explanation, reference is made to EP 449923 

0 (Setratech) , the contents of which hereby are 'incorporated by. 

reference. ' •• 

In another preferred embodiment the cell used in the 
. "process is impaired- ; or hampered in the major recombination en- 
zymes such as AddAB. It has been shown that a strain impaired in 
5 the AddAB genes' shows a higher frequency of homologous recombi- ■ 
nation. Both legitimate and illegitimate recombination frequency 
are increased. But especially the recombination frequency for 
illegitimate recombination is favoured when working with a ■ 
strain impaired or hampered in the AddAB ATP-depenqent nuclease 
(Meima, • R ;. Haigema, BJ; . Dijkstra, H; Haan, GJ; Venema, G; Bron, 
S (1997) Role of enzymes of homologous recombination in ille- 
gitimate -plasmid recombination 'in Bacillus-subtilis . JOURNAL OF 
BACTERIOLOGY Vol. 179 , No. 4 pp. 1219-1229.) 

■In another preferred , embodiment the cell used . in the 
35 process is over-expressing one of the key recombination enzymes, 
recA. The recA enzyme is involved in homologous recombination by 
promoting renaturation of ssDNA to form the heteroduplex molecules 
necessary for the recombination event and initiates the exchange 
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of DNA strands. (PROGRESS IN NUCLEIC ACID RESEARCH AND MOLECULAR 
BIOLOGY Vol. 56 , pp. 129-223 * (1997) ').. The recA over-expression 
can be achieved by either classical mutagenesis of the^ host strain 
or cloning of a highly expressed recA gene . 
5 In another preferred embodiment the cell used in the 

process is subjected to UV, X-ray or DNA damaging agents to 
increase recombination frequency between the two homologous genes 
to be shuffled. Stress like UV, X-ray or DNA damaging agents is 
known to induce the recombination enzymes via the SOS response arid 
10. activate enzymes involved in recombination processes like recA. 
(BACILLUS SUBTILIS AND OTHER GRAM- POSITIVE BACTERIA, pp. 529-537 
(1993) ) . 

In another preferred embodiment the pTR based plasmid used 
as a carrier of one of the genes to be shuffled is replaced by 
15 either a low-copy version of pTR or, another B. subtilis compatible 

low-cony plasmid like pWVOl, "(MOLECULAR AND GENERAL GENETICS Vol . 5 

■ v. 

249 , No. 1 pp. 43-50 (1995 ) ) , pTA1060, (JOURNAL OF BACTERIOLOGY^ 
■Vol. 142 , No. 1 pp. 315-8 (1930)) cr pAMbl , (JOURNAL OF * 
BACTERIOLOGY Vol. 157 , No. 2 pp.' 445-53 (1984)). ' 

20 In a further preferred embodiment the one of DNA construct 

comprising one the genes to be shuffled is situated on the 
chromosome -.of the cell and another DNA construct, comprising . 
another. gene of interest to be recombined is a plasmid. When 
performing an in vivo recombination process of the invention the 

25 plasmid will be recombined with the DNA construct on the 

chromosome. Said plasmid may e.g. be a temperature sensitive 
plasmid. 

The advantages of 'using a low-copy pTR, or a .system where 
one of the DNA constructs is situated on the chromosome as 

30 described ■ above, is that the hybrid DNA construct after 

- recombination of the two homologous genes will be the predominant 
form of the genes in the cell. A high-copy- background of.'non- 
r ecombined pTR plasmids will inevitably reverse the gene hybrids 
formed in the first round back to wildtype after the- second round 

35 of recombination and therefore prohibit any shuffling of the two . 
genes. When using the chromosome as a carrier of one of the genes 
no wildtype background can coexist when the pTS has integrated via 
the homologous genes. 
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It is preferred that the cell used in the process is a, cell 
suitable for large scale production of the product of the 
recombined DNA sequence. It is preferred that the cell is a 
bacterial cell, more preferred a Bacillus. cell, most preferred a 
5 Bacillus subtilis cell- . - '• "■ 

The DNA sequences to be recombined may be isolated/cloned 
from organisms containing the sequences in a manner known per se. 
For instance, ; a suitable oligonucleotide probe may be prepared 
on the basis of a first DNA sequence of interest in order to 
0 obtain a second DNA ' sequence homologous, to the first sequence. 
Alternatively, the DNA ; sequence can be amplified by PCR based o 
primers designed according .to knowledge of conserved regions . 

I different embodiments of the ' present content is used ; 
the term- "a set of homologous . DNA ..sequences . ( "set A" "set B M ,\ , 
5 "setC" : etc. ) " , e.g. in .relation- to an embodiment ..of .the 
. invention relating to a process according to the invention.,, 
'wherein the homologous DNA sequences are comprised of two or. 
more different set of : homologous DNA sequences ("set A" "set B" 
"setC" etc . ) , •. and wherein each, of said set of homologous 
0 sequences ("set A" "set B", "setC^ etc. )_ are inserted in !a ■ 
• different. DNA construct'. (."DNA construct A", "DNA. , construct B" , • ' 
•"DNA construct' C" etc..) ^"according" to .the invention.. ~\ . - 

This term is used to illustrate that an in vivo 
'recombination process .of the present invention may .'be used' to 
5 ' r'eco™bine numerous: ".sets" of .homologous DNA sequences., Each' / 
individual "set." of homologous ;DNA. sequencers) may individually 
comprise, from only one to numerous .(e.g. a "set" .comprising a 
DNA library) individual, homologous' DNA sequence (s) . • - \ . 

Each .of said "sets" of .homologous DNA sequences : ( "set A' 
cutset B" / "setC"' etc.). may "then be inserted ' in a different DNA : ; ' 
construct ("DNA construct A", "DNA construct B" , , "DNA construct! 
C" etc.) according to the invention. Each of said different DNA 
construct are preferably different on important parameters, 
according to the invention, such as different origin of 
s replication, different selectable markers etc. 

All of said DNA constructs. ("DNA construct A", "DNA 
construct B", "DNA construct C" etc..) may then be transferred 
" into a cell, according to the invention, and when performing a 
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process for in vivo recombination of .the invention, then each "of 
the individual DNA constructs ("DNA construct A", "DNA construct 
B", "DNA construct C" etc..) may then recombine randomly 
according to the invention. After a few repetitions of a 
5 process, of the invention, this will give a DNA population, in 
the cell population, comprising numerous different, combinations 
of different recombined DNA sequences, recombined randomly 
between all "sets' 1 of homologous DNA sequences used in the 
process. 1 

10 Further, the term "a set of homologous DNA sequences 

("set A". "set B", "setC" etc.)*' is used herein- to illustrate one 
of the advantages of an in vivo recombination process of the 
invention, . which is the flexibility of the system to recombine 
.both set of homologous DNA sequences comprising one, few and/or 

15 numerous (e.g. big DNA libraries) of individual homologous DNA 
sequences. 

Accordingly, an embodiment of the invention relates to;-> 
•process according to the invention, wherein said sets of 
homologous DNA sequences each only contain one DNA sequence 'w 

20. (i.e. recombination of two or more homologous sequences). 

Further in embodiments of the invention the. homologous^' 
DNA sequences to' be recombined are derived from a pre-made DNA: . 
library and/.or derived from individually different pre-made 
libraries. < .*£. 

25 Homologous DNA derived "from such DNA libraries are' 

inserted into a vector according to the ' invention (i.e. DNA re- 
presenting the DNA library is ligated (inserted) into a vector). 
As described herein the process of the . invention comprise at 
least two DNA structures (e.g. vectors.) ,'■ which are different in 

30- a way which allow forced in vivo recombination of said vectors. 
Homologous DNA representing DNA libraries may be 
inserted into only one and/or. both of said different vectors . 

I.e. if for instance homologous DNA representing- one DNA 
library is inserted (ligated) into one of said different vectors 

35 and homologous DNA representing another DNA library is inserted 
into the other vector this will allow fast and efficient 
recombination between two different libraries by an in vivo 
recombination process according to the invention. To illustrate 
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this concept, DNA representing one library may be inserted .into 
' a pTR vector (see figure 1) and DNA representing another library 
may be inserted into a pTS vector (see figure 1) . Just' a few re- 
peating in vivo recombination events performed by a process- ac- 
s cording to the invention will then give a cell population 
representing a new library which is a combination of the two 
libraries mentioned above. ' 

',. Accordingly, embodiments ..of the invention relates to a 
process according to the invention', wherein at least one of the 
10 sets 'of; homologous DNA sequences- is derived from a pr e-made DNA 
;.; library./ and at least, one of said'-sets of homologous DNA • 
sequences is v a. DNA sequence/ which is homologous to said DNA 
library (i.e. resulting in a process for in vivo recombination-, 
of said DNA library to a DNA sequence -which are .homologous' to 

is said DNA library).; or -'\ -,' 

_ a process according to ■ thV"' invention, wherein the- sets 
. of homologous DNA sequences is derived from . the same pre-'nade • 
.. DNA library (i.e.. resulting in a process for in vivo 

recombination of said DNA, library); or - • 

■ "-• . -a process according to 'the. invention, wherein the sets 
; of homologous' DNA sequences are derived from- atfleast "two / ■ ■ 
• different' pre-made DNA libraries'^ (i . e . resulting in a process ■ 
' 'for in. vivo recombination of at least two' different DNA' . " 
. ' libraries) ; '.- . ' .. '' . '. ■' 

, 25 . .. ■ In Order to .obtain, "even further recombination between the 
DNA sequences if may be preferred to perform ah' isolation of the- 

"; .'DNA structures after .a first round of. "in- .vivo recombination of-. ' 
said DNA sequences, .performed according, to • the' invention '.. Said - 
isolated. structures are • then re-trans formed into the cells, and a 

30'.-' further process round of in vivo recombination is then ' : 

performed.- . ' ' 

"The advantages of this step is illustrated by following 
example: • * 

Set A is a vector A wherein' DNA sequences a, b are 
35 introduced individually in said vector A (i.e. the vector A 

population comprises a vector with DNA sequence a, and a vector 
with sequence b) ; and 
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Set B is a vector B wherein DNA sequences c, d are- 
introduced individually in said vector B (i.e. the vector B 
population comprises a vector with DNA sequence c, and a vector 
with sequence d) . 
5 Vector A and B are then transformed into a cell 

population and in vivo recombined according to the invention. 
This will give recombination between DNA sequences ac, ad, be, 
bd. The vectors A, 'and B comprising said recombined sequences is 
isolated, and re-transformed into said. cell population. In vivo 

10 recombination according to the invention will then give all 
possible' recombination of said four (a,b,c,d) DNA sequences, 
such as recombination of abc, abdc, etc. 

•In even further embodiments the invention relates to a 
process according to the invention, wherein two sets of homolo- 

15 gous DNA sequences ("set- A" and "set B") are recombined in vivo, 
comprising the steps - 

a ) inserting sequence(s) from "set A" in a vector contairji- 

ing a temperature resistant origin of replication and^a 
• - . ■ -? 

gene for a first marker; v 

20 b) inserting sequence (s) from "set B" in a vector contain-' 

ing a temperature sensitive origin of replication and a 

gene for a second marker; 

c) transferring the- vectors into a cell; 

$y * incubating the cell at a temperature where both^ origins 
25 of replication are functional, optionally under a selec- 

tion pressure that favours cells with both marker genes; 
e) ' shifting the temperature 'to a temperature, where only 

the temperature resistant origin of replication is func- 
■ tional ; . * ■ 

30 f) . repeating steps d)-e) at least once; 

or ' 

' - a process according" ..to rhe ■ invention, wherein two' sets 
35 of homologous DNA sequences ("set A" and "set.B;") are recombined 
in vivo, comprising the steps 
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a) • . inserting sequence (s) from "set A" in a vector contain- 

ing a temperature resistant origin of replication and a. 
.gene for a first, marker; 

b) "inserting sequence (s) from "set B" in a vector contain- 
5 ing a temperature sensitive origin of replication and a 

* < .gene for a second marker; 

c) transferring the 1 vectors into a cell; 

d) Incubating the cell at a temperature where only the tem- 
perature resistant origin of replication is functional,. 

10 optionally under a. selection pressure .that favours cells. 

with both marker genes; ' '* ^ 

■ e) 'optionally, shifting the temperature to a temperature, 

where both origins of 'replication; are functional ; * 
f)- repeating steps d ) - a ) at least. once. . 

is . ' : L < • ' ■ / • • . " 

In a further aspect the invention relates to- a^ method for 
producing' one' or more< recombinant prdtein(s) having' a desired 
biological activity comprising: 

20 a). performing a process for in vivo recombination of homo- 

logous DNA sequences . ( the process is perf ormed according 
to the invention as described herein) wherein at least 
■ one of the DNA structure ( s) is an expression' vector; 
b)~ -expressing the'- numerous different recombinant polypeptides 

25 t ' " .encoded by the numerous .different recombihed 'DNA sequences. 

, ' ■ ' from, step a) ; and . . v " \ • "' 
^ c) screen or select ' the numerous different recombinant 

proteins from. step b) : in a suitable screening or selection , 
,-- - system for one ' or more recombinant polypeptides (s) having 
30 a desired activity. . >■ ' * 

Preferably, the expression vector in step a) above is a DNA 
structure which is unable to replicate (e.g. having a non- 
functional origin of replication) during a forced hybrid DNA 
35 structure formation (i.e. integration of said structures into 
each other) performed according to a process for in vivo 
recombination of homologous DNA sequences according to the 

i, 

invention. This will improve the probability that the expression 



WO 98/28416 



19 



PCT7I)K$7/00567 



vector, after a crossing-out event of said hybrid-structure 
according to the invention, comprise recombined homologous DNA, 
since the only way said expression vector may have survived the 
process of forced hybrid formation is by- actually recombine with 
5 another DNA structure, which is able to replicate under the 
conditions used to force said hybrid-structure formation. 

After the process for in vivo recombination have been 
performed (step a) above) , and before the expression and 
screening steps b) and c) above, it may- be advantageously to 

10 make a* cell population mainly comprising only the expression 
vector (i.e. other DNA construct (s) are partly or nearly 
completely removed from the cell .population) . This is particular 
advantageously when the expression vector is unable to replicate 
during the forced hybrid DNA structure formation as described 

15 immediately above, since such a cell population will have a very 
limited background of non-reconbined DNA sequences of interest, 
which is advantageously in the subsequent screening step (ste'p 
c) above) . 

Making a cell population mainly .comprising only the 
20 expression vector nay be. performed according to standard pro-- 
cedures known in the art,, such as growing the cell population" at 
conditions selectively favouring replication of the expression 
vector, as compared to the other DNA structures; or selectively 
- purify said expression vector form the* cells and subsequently" 
25 transfer this into a new cell population. 

. Expression the . recombinant polypeptides encoded by the 
recombined 'sequence in step a) above may be performed by use of 
standard expression vectors, and/or a standard expression vector 
may be modified in order to comprise elements specially suitable 
30 for a in -vivo recombination process according to the invention. . 

A suitable screening or selection system, to screen or se- 
. lect the numerous different recombinant proteins from step ,b) ' 
above; .will depend on the desired biological activity,. 

\ A number of suitable screening or selection systems to 
35 screen or select for a desired biological .activity are described 
in the art. Examples are: 
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Strauberg et al. (Biotechnology 13: 669-673 (1995), which 
describe a screening system to screen for Subtilisin variants hav- 
ing a Calcium-independent stability; 

Bryan et al. (Proteins 1: 326-334 ,(1986) ), which describe a 
5 screening assay to screen for Protease having a enhanced thermal 
stability; and 

PCT-DK96/00322 which describe a screening assay to screen 
for lipases which are having an improved wash performance in wash- 
ing detergents. 

10 A preferred embodiment of the invention comprise screening 

or selection of recombinant protein '(s) , wherein the desired bio- 
logical activity is improved performance in dish-washing or laun- 
dry detergents. Examples of suitable dish-wash cr laundry deter- 
gent are disclosed in PCT/DK96/00322 and WO 95/30011. : 

15 The"' invention also comprises a set . (.at least two) of 

vectors, comprising at least two different origins- of replica- 
tion able to function - in the sane, cell under permissive con- 
ditions, and at. least two homologous DNA sequences encoding 
polypeptides. In, a preferred '.embodiment ,. ■ at least one origin of 

20 replication is temperature sensitive*. ■. The vector may preferable 
" be -a plasmid . '. - •' 
The invention 'further comprises a- vector. systen, com- 
prising at least - 

a) . a vector A containing a DNA sequence encoding a polypep- 
25 tide and an origin of replication, and 

b) a' vector B containing' a DNA sequence homologous to the 
DNA sequence in vector A," and an origin of replication 
that differs from the origin of replication in .vector A. 

• The vectors may preferable be plasmids. 

30 * ' ' / . '•' • " . ' '• , 

It is preferred that the DNA sequence encodes polypep- 
tides with a biological activity, especially, enzymes and in par- 
ticular enzymes such as an amylase, lipase, cutinase, cellulase, 
oxidase, phytase, and a protease. 
'35 The invention further comprises a new, recombined DNA 

sequence, characterised in that has be produced by a .process ac 
cording to the invention, or it is encoding a recombinant pro- 
" ' tein having a desired biological activity, wherein said recombi 
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nant protein having a desired biological is produced , by the 
method according to method for producing one or. more recombinant 
protein (s) having a desired biological activity according to the 
invention. 

5 The invention further comprises a process for producing 

a polypeptide , especially an enzyme, comprising i) growing a 
cell expressing a polypeptide encoded by a recombined DNA se- 
quence according to claim 38, and ii) purifying said expressed 
polypeptide. 

10 The invention further comprises a new polypeptide, espe- 

cially an enzyme, characterised in that it is produced by a 
process according to the invention. 

The invention further comprises a. polypeptide, espe- 
cially an enzyme, -characterised in that it is encoded of a re- 
15 combined DMA sequence according to. the invention. 

Polypeptides of the present invention include .peptides and 
proteins, and optionally glycosylated forms thereof. Examples are,, 
. enzymes, pharmaceutical^ active polypeptides such as analogues,; 
of human or mammal polypeptides, e.g. insulin, growth hormone, , r 
20 human hormones or growth regulators. 

The invention is • described in further detail in the 
following examples which are not in any way intended to limit the.. 
■ scope of the invention. 

25 EXAMPLES 

MATERIALS AND' METHODS 

Strains . 

30 E.coli: SJ2 (Diderichsen,, B., Wedsted, U. , Hedegaard, L. , 

Jensen, B.:..R., Sjoholm, C. (1990) Cloning of 'aldB, which encodes 
alpha-acetolactate decarboxylase , an exoenzyme from Bacillus 
brevis. J. Bacteriol . , 172, 4315--4321) .Electrocompetent cells 
prepared and transformed using a Bio-Rad GenePulser™ as 

35 recommended by the manufacturer. 

B.subtilis DN1885 Diderichsen, B. , Wedsted, U. , Hedegaard, L. , 
Jensen, . .B. R. , S.joholm, c. (1990) Cloning of aldB, which encodes 
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alpha-acetoljactate decarboxylase, an exoenzyme from Bacillus 
brevls. ■ J; \Bacteriol. , 172 , . 4315-4321) . Competent cells were pre 
pared and transformed as described by Yashin, ;R.E. , Wilson, G.A. 
and! Young/ F.E*. (1975) Transformation and transfection in ly- 
5 ,'sogenic strains of Bacillus subtilis : evidence for selective in- 
duction of prophage i in competent cells. * J. Bacteriol/ 121:29 6-304 . 

B. subtilis PL1801.. This strain is the B. subtilis DN1885 with , 
disrupted apr and npr genes (Diderichsen B,, Wedsted, U. , / 

10 ■ Hedegaard, L. ,. Jensen, B . R. > Sjoholm, p ( 1 9 9 0 ) C 1 on ing of ] , 

v aldB, which 'encodes alpha-acetolactate decarboxylase/- an 

exoenzyme from Bacillus brevis.- JV Bacteriol., 172, 4315-4321). 

Plasmids : ' ^ . 

15 * pSX120": B . subtilis expression vector 1 (Described in EP 405901,, 
and WO 91/00345) . This vector " comprise .the wild-type Savinase ' 

gene . • ' ; _ " >' ' \ \ ' - ' ' - ; 

pE194.:, . (Horinouchi, S and Weisblum, 3. , 1982, J I Bacteriol .. 

: * 150 :804-814)', ' '■ \. Y ', 

20 pSX222 : B . subtilis .expression vector (Described in WO : 9 6/34946 , 
and WO 92/19729) . This ' vector comprise the synthetic Savinase . 
gene herein' called Sav.isyn . In the Sav.isyn gene is introduced a- 
• " large number of restriction sites as compared to the wild-type 

, : '"Savinase .gene' comprised in pSX12 0 . x . ... " " . ; 

25-'. ' ■ . '• ' .. ' 1 ■ • • : ■ ' ■ ■' .." ' . t t ; ■ " */ 

pUC19 : Yanisch-Perron/ C. , Vieria, JV and Messing/ J. (1985) " . 
Gene 33 , 103-119 . ' • ■ ;. - > ■ 

pF64L-S65T-GFP: is the- E.coli plasmid encoding the DNA, fragment 
30 encoding the F64L-S65T-GFP described in WO 97/1109 4 . / . • 

pMUTIN-4-MCS: Plasmid/can be obtained from Laboratbire de Genet'i- 
\ qu e Microbienne, -Institut National de la Recherche Agronomique , ■ 
78352. Jouy en Josas - CEDEX, France. 

35 

Solutions /Media 
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TY and LB agar (as described in Ausubel, F. M. et al; 
(eds.) "Current protocols in Molecular Biology"; John Wiley and 
Sons, 1995) . 

LBPG is LB agar supplemented with 0.5% Glucose arid 0.05 M 
.5 potassium phosphate, pH 7.0 ' 

LBPGSK plates are LBPG media with agar and 1% of skimmed 
milk. , 

Antibiotics are used in different medias in concentrations 
of 6 |ig/mi Chloramphenicol, 10 |ig/ml Kanamycine and 5 ug/ml og 
10 Erythromycine. 

TE-buffer (as described in Ausubel, F. M. et" al. (eds.) 
"Current protocols in Molecular Biology". John Wiley and Sons, 
1995) . ~ 

15 General molecular biology methods 

DMA manipulations and transformations were performed using' 
-standard methods of molecular biology (Sambrook et al. (1989) 
Molecular cloning: A laboratory manual, Cold Spring Harbor lab.;' 
Cold Spring. Harbor, NY; Ausubel, F. K. et al.- (eds.) "Current 
20 protocols in Molecular Biology". John Wiley and Sons, 1995; 

Harwpod, C. R.., and Cutting, S. K. (eds.) "Molecular Biological 1 ' 
. . Methods for Bacillus" . John v;iley and "Sons, 1990). 

• ■ Enzymes for DNA manipulations, were used according to the 
specifications of the suppliers. ' 
25 • " - • 

Enzymes "for" DNA 'manipulations - ■ 

Unless otherwise mentioned all enzymes for DNA ; 
manipulations, such as e.g. restriction endonucleases , ligases, 
polymerases, PCH polymerases etc., are obtained from New England- 
30 Biolabs , Inc. - 

Construction of the temperature sensitive shuffling olasmid 

pMB43 0 was constructed in 'a .five step cloning- process as . 
depitched in the following, the entire DNA sequence is '-found. in 
35 SEQ ID No. 1,., 
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The temperature sensitive plasmid pE194 was modified by intro- 
ducing multiple , cloning sites around the unique Clal site of the 
pE194 plasmid and thus destroying the Clal. site. This introduc- 
tion was done by PCR using the primers #22899 and #24039 to am- 
s plify the whole plasmid and introducing several .unique cloning 
; sites. 

The PCR was performed as follows: 

1* imI of pE194 Qiaquick plasmid prep was used for 50 /il L 
PCR amplification in PCR buffer (10 mM Tris-HCl, pH 8.3., 50 mM 
ip KC1, 1.5 mM MgCl 2 , 0.01 % (w/v) gelatin) containing 200 /iM ofv 
\each dNTP, 2.5 units of AmpliTaq polymerase. (Perkin-Elnier / 
Cetus> USA) and 100 pmol of each -primer: . 
#22899 * 

5 1 -GCAGCTGGATCCGCGGCCGCGAATTCGTCTTTTGCGCAGTCG'GC-3 ' 

15 ' « 

• • #24039 

\ , 5 1 -GCAGCTGGATCCGGGCCCGGGAAGCTTCGATTCACAAAAAATAGGCACACG-3 ' 

The PCR reactions were performed using a DNA thermal 
20 cycler (Landgraf, Germany). One incubation at 94°C f or 1, min 
/•followed by forty cycles of PCR performed using a cycle profile, 
of denaturation at 94°C. for 1 'min., annealing at 55°C for 1 min, ■ 
and extension at 72°C for, 2 min; Five-^1 aliguots of the- ampli- 
fication product was, analyzed by electrophoresis in 0,7 % ■ 
2 5 agarose gels (Nu Sieve, FMC)'. " • : 

', Subcloning of PCR .fragment " ■ 

Forty-f ive-/il .alaquots of the PCR products generated as 
described above were, purified using QIAquick PCR purification. 

30 kit (Qiagen, USA) according to the ' manufacturer 's Instructions. 
The purified DNA was eluted in 50 fil of 10-mM Tris-HCl, pH 8.5. 
Fifty -/il -of the purified PCR fragment was digested with BamHI, 
electrophoresed in 0.8 % low gelling temperature agarose 
(SeaPlaque GTG , FMC) gels, the relevant fragment was excised 

35' from the gels, and purified using QIAquick Gel extraction Kit 
(Qiagen, USA) according to the manufacturer's instructions. The 
isolated DNA fragment was then ligated to BamHI digested pUC19 
and the ligation mixture was used to transform E. coli SJ2 . 
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Cells were plated on LB agar plates containing Erythro- 
mycin (200 /xg/ttl) supplemented with X-gal (SIGMA, USA) (5-Bromo- 
4-chloro-3-indolyl alpha-D-galactopyranoside, 50 /ig/ml) . 

5 identification and characterisation of positive clones 

The transformed cells were plated on LB agar plates 
containing Erythromycin (200 /ig/ml) supplemented with X-gal (50 
/ig/ml) and incubated at 37°C over night. Next day white colonies 
were rescued by restreaking these onto fresh LB- Erythromycin 

10 agar plates and incubated at 3 7°C over night. The next day 
single colonies of each clone were transferred to liquid LB 
medium containing Erythromycin (200* /ig/ml) and incubated 
overnight at.37°C with shaking at 250 rpm. 

Plasmids were extracted from the liquid cultures using 

15 QIAgen Plasmid Purification mini kit (Qiagen, USA) according to 
the manufacturer's instructions. Five-^1 samples of the plasmids 
were digested with BamHI. The digestion were checked by s;£V , 
gelelectrophoresis on a 0.7 % agarose gel (NuSieve, FMC) . The 
appearance of two DNA fragments of about 2.7 kb and 3.7 ^ 

'20 indicated a positive clone. The clone was designated MB2S3 and \ 
the plasmid pMB293 . 

2) ■ ' 

PMB293 was then EcoRI digested and the 3.8 kb fragment was 
25 gelpurified on a 0.7 h agarose gel and using QiaQuick gelpurifi- 
cation..kit from. Qiagen according to the manufacturer. This frag- 
ment which constitutes the pE194 with inserted multiple cloning- 
sites (originating from thePCR primers and from MCS of pUC19) 
was circulised by ligation and used to transform. B . subtilis 
30 DN1885, transformed cells were selected on LBPG plates, contain- 
ing 5 /ig/ml" Erythromycin and incubation was done at 33 °C over- 
night. Clones from this transformation were restreaked and cul- 
tivated overnight in TY.with 5 ug/ml Erythromycin. From over- 
night* culture broth cells were isolated and plasmids were puri- 
35 fied using Qiaquick plasmid kit. Restriction enzyme digestion of 
the plasmids verified the correctness of the clones. One such 
clone and plasmid were termed MB333 an pMB333, 
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The pMB33 3 was then used as the temperature sensitive plasmid 
for the construction of a plasmid containing the the Sayizyn 
gene. without any promoter. This construct was made as follows. 

, PCR using the primers #21547 ^ and ;,#2 154*8 and the pSX222 as 
'i the template was performed according to these conditions: 

1 fil of pSX22 2 Qiaquick plasmid prep was used for 50 /il 
10 ..PCR. amplif ication in PCR buffer (10, inM Tr i s -HC 1 , . pH . 8 . 3 , « 50 mM 
KC1, 1.5 mM MgCl 2/ 0.01 % (w/v) gelatin) containing 200 pM of • 
each dNTP, 2.5 units of AmpliTaq polymerase (Perkin-Elner , . 
Cetus, USA) and 100 pmol of each primer: • .' 

#21547 • t ';. ",' ' . / . : . ■ ' ' 

15 5 1 ~GCAGCTGCAGGATCCGAATTCGCGGCCGCGCAATCGGTACCATGG--3' ' 

#21548' / , .. - 4 " s ' • 

.5 .' -GCAGCGAGCTCAAGCTTCCCGGGCCCAGCCGGTCGACCGCGTTGCCGCTTCTGCG--3 '•' 

,. ■ - *' ; . The PGR reactions .was performed using a - DNA .thermal 
20 cycler (Landgraf, Germany) . One incubation' at 94°C for ;l min 

followed by forty cycles of: PGR performed using a * cycle profile ~ 
- : of denaturatipn at. 9 4°C for 1 min-. annealing at 55°C for 1 min, , 
' • and extension at 72 °C for 2 min'; -Five-Mi aliquots of the ampli- , 

fication '.product was analyzed by, electrophoresis in 0.7';% , ■" : 
25 agarose gels ,( Nu Sieve-, FMC) \ '•. : " ^ ' ; . ' 1 ' 

■■■■>•. Subclonincr of PCR f xaqm'ent / . - ' •, ' • ' ' 

: . . : -Fortyf ive-^1 aliquots of the 1 PCR products generated as. 

described above were purified using QIAquick P.CR purification 
'30 kit " (Qiagen, USA) ' according to the : manuf acturer ' s instructions'.." . 
The purified; DNA was eluted in 50 >1 of lOmM Tris-HCl r pH/8.5. 
Fifty, -jLil of the purified .PCR. fragment was digested with BamHI 
and Apal, electrophoresed in 0.8 % low gelling temperature 
agarose (SeaPlaque GTG , FMC) gels, the relevant fragment was 
35 excised from the gels, and purified using QIAquick Gel 

extraction Kit (Qiagen, USA) according J to the. manufacturer ' s 
instructions. Purified fragment was ligated to gelpurified BamHI 
and Apal digested pMB333. The ligation was used to transform 
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B.subtilis DN1885. Transformed cells .were' plated on LBPG agar 
plates. containing 5 /xg/ral Erythromycin and incubated overnight 
at 33°C. Clones were restreaked on similar plates and plasmids 
were isolated from overnight cultures of clones incubated at 
5 33 °C in TY with 5 iiq/nl Erytrhomycin plasmids were isolated 
using Qiaguick plasmid kit as described by manufacturer. 
Plasmids were checked by enzymatic digestion using Apal and 
BamHI. One correct clone was termed MB3 39 and the plasmid 
pMB339. 

10 

4) 

We wanted to test an easy way of .monitoring that hybrid plasmids 
had formed. Thus we wanted to monitor the forced recombination 
between the two homologous genes on the two seperate plasmids. 

is This was done by constructing a pE194 derivative having a pro- 
moter-lacking Savizyn transcriptionally fused to an ORF encod- 
ing GFP . In this way there would be no expression of GFP (duetto 
lack of promoter upstream of the Savizyn and thus the GFP gene) 
unless the two homologous genes had recombined and thus getting 

20 the GFP under control of the promoter otherwise present on - 
pSX120. The plasmid was constructed by ' PCR- amplifying the GFP '' 
gene and introducing it just downstream of the savizyn' gene on 
the DM3339 . 

The GFP encoding DNA used herein- is* a derivative of" 
25 the original wildtype gene, the derivative of GFP was cloned 
-■ from the DMA construction, of the. mutant F64L-S65T-GFP which was 
constructed as described in international patent application WO 
97/11094. 

The construction of the template plasmid was de- 
30 scribed in WO 97/11094. The DNA fragment encoding the F64L-S65T- 
GF? was PCR amplifyed from the E.coli plasmid encoding this gene 
as follows: . . " ' • 

1 pi of pF64L-S65T-GF? Qiaquick plasmid prep was used 
for -50 pi PCR amplification in PCR buffer (10 mM Tris-HCl, pH 
35 8.3, 50 mN KCl, 1.5 mM MgClo, 0.01 %" (w/v) gelatin) containing 
200 /iM of each dNTP, 2.5 units of AmpliTaq polymerase (Per- 
kin-Elmer, Cetus, USA) and 100 pmol of 'each primer: 
#20231 
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5 A-AACTGCAGAAGATGTGGACGCGC-3 1 

#101381 • . . . ' > 

> 5 ' -GCCC^TGCATAAACTGCATCCCTTAACTTGTTTTATTTGTATAGTTCATCCATGCCATG- 

■ 3 1 ■ • - - '/ . ■". ■' ' ' . - ' ' 

5 . : j; . - ' /, ■ ■ ;/ ■■ 

1 v r The PGR reactions was performed using a DNA thermal 

cycler (Landgraf , - Germany) . - One incubation at 94°C for 1 min 
followed by forty cycles. of PCR performed using a cycle profile 
of denaturation at ?4°C for 1 min, annealing at! 55°C for 1 min, 
10.. and extension at 72, °C for 2 min. Five-/il aliguots of the ampli- 
fication product was analyzed by electrophoresis in 0.7 % 
agarose gels (NuSieve, FMC) . : 

• Subclonina of PCR fragment . 

15 -Forty-f ive-/il aliquots of. the PCR products generated as 

described above were purified using QIAquick PCR purification 
kit (Qiagen, USA) according to the manufacturer ' s instructions. 
The. purified DNA was eluted in 50 ^1 , of . lOmH fris-HCl, pH ;8 . 5'. 
The PCR fragment and pMB33 9 were "digested with Clal and Nsil, . . 

20 gelpurif ied as above, and Migated together.. The ligation mixture 
was used, to transform DN1885, transformed cells were plated on 
LBPG 5 fig/ml Erythromycin. After 18 hours of incubation green 
f lour.escent cells were visible as single colonies/ One of .these 
were analysed' and 1 saved as KB406 and the corresponding plasmid as- 

25 pM34 06. Since th.e GFP encoding GRF did not have an upstream pro- - 
moter driving the transcription a possible explanation for the GF? ' 
/ expression could be a read through from a promoter 'further up- 

V- straem of the Savizyn-GFP . transcriptional .fusion , namely- the pro- > 
moter of the repF gene. ' . \ 

30 . ■ . . * . ' , " k ' . ' •' • • . " 

In order two abolish any read-through into the ORF of GFP, a tran- 
scriptional terminator was inserted between the repF gene and the 
Savizyn DNA. The terminator was amplified from the pMUTIN-4-MCS 
35 plasmid using the following primers and conditions: 

1 /xl of pMUTIN-4-MCS Qiaquick plasmid prep was used for 50 \l\ 
PCR amplification in PCR buffer (10 mM Tris-HCl, pH 8.3, 50 mM 
KC1, 1.5 mM MgCl 2 , 0.01 % (w/v) gelatin) containing 200 m of 
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each dNTP , 2.5 units of AmpliTaq polymerase (Perkin-Elmer , 
Cetus, USA) and' 100 pmol of each primer: 
#102588 ' 
5 1 -CGGACGGTACCGGTAATGACTCTCTAGCTTGAGGC-3 1 
5 #102589 

5 1 -GGGTACAGATCTCCGCGGCGCAAAAAGGCCATCCGTCAGG-3 1 

The PCR reactions was performed using a DNA thermal cycler 
(Landgraf, Germany) . One incubation at 94°C for 60 sec followed 
by forty cycles of PCR performed using a cycle prof ile of dena- 
10 turatioh at 94°C for 30 sec, annealing at 55°C for 30 min, and 
extension at 72 °C for 1 min. Five-^tl aliquots of the ampli- 
fication product was analyzed by electrophoresis in 0.7 % aga- 
rose gels (NuSieve, FMC) . 

The amplified DNA was purified as above and cloned as a Kpnl- 
15 Bglll fragment in pMB406 also Bglll.and Kpnl digested (same proce- 
dure for gelpurifying etc. as decribed above). The ligation mix-tfr 
ture was used to transform DN18S5, transformed cells were plated $t 
on LBPG 5 Mg/mi Erythromycin. After 18 hours of incubation cells' 1 - 
were visible as single colonies. One of these were ananlysed and"^ 
20 saved as MB430 and correspondingly the plasmid pMB4-30. This' clone 
had absolutely no expression -of G-FP indicating that the cloned 
-terminator in fact worked as anticipated. 

The resulting sequence of the entire plasmid pMB430 is listed as'" 
SEQ NO 1. . ■ ' 

25 • • 

Establishing the MB 4 3 2 strain ' 

PL1801 Bacillus subtilis was transformed ■ with the two plasmids 
pMB430 and pSX120 using approximately 1 jig 'of each plasmid. per 100 
Ml of competent PL1S01. The transformed cells were plated onto 

30. LBPG-5 Mg/ m l Erythromycin, incubated overnight at. 33 °C. Next day 
colonies were- restreaked onto L3PG-5 pg/ml Erythromycin +. 6 /xg/ml. 
Chloramphenicol and incubated overnight at 33°C. Clones were also 
grown i liquid* culture TY- 5 pg/r-.l Erythromycin .+ 6 fig/nl Chloram- 
phenicol overnight at 33°C, plasmids were purified, from culture 

35 broth and the existence of both pMB430 and pSX120 in .the PL1801 
strain was verified. This clone was saved and termed MB432. 
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EXAMPLE 1 . - 

Sequence shuffling of two homologous sequences both coding for 
the subtilisin gene of Savinase 

5 The Savinase vt gene (comprised .in pSX120) coding for a 

protease of the subtilisin type was chosen as the; target for 
' shuffling of highly homologous genes. A site-directed 1 variant of 
the Savinase gene (hereafter called Savizyn, comprised, in 
pSX22 2, where a large number of silent mutations giving new re- 
ap strict ion s i tes are introduced, was used as the shuffling part- 
ner to Savinase. The two genes were cloned on : two separate com- 
patible plasmids according to the invention. A transcriptionally 
active Savinase gene was cloned on a plasmid based on the pUBHO 
origin coding for resistance to chloramphenicol (pSX120, fig. 
15'' 4). A promoter less Savizyn gene was cloned on a derivative of 
' the temperature sensitive plasmid pE194 which also codes for re- 
sistance to erythromycin (pMB430) . Two transcription terminators 
were cloned upstream of. the .Sav.i zyn. gene to prevent any read- 
through. 

20 ' In pMB430 the green florescence protein (GFP) was tran- 

scriptionally fused to the Savizyn gene in order to be able to 
. visually ' follow the in-cross and out-cross events between /the/ 
. two plasmids .• . . 

■B.'subtilis strain K3432, pLISOl transformed with the two 

25 plasmids pSXl-20 . and " p>S34 30 , were taken through two cycles, of • 
temperature shifts to induce two ' consecutive double-cross over 
events between the homologous genes. Homologous .recombination 
•and plasmid hybrid formation was selected on LBPG plates at^ the 
restrictive temperature (47-50°C) in the presence of erythromy- 

30 'cin. Under these restrictive conditions, the temperature sensi- 
tive plasmid pMB430 is forced into the stable pSX120 plasmid by 
homologous recombination in the region of Savinase/Savizyn . 
These colonies where recombination has occurred- also becomes 
fluorescent as. a result of the recombination between the two 

35 plasmids and concomitant activation of the Savizyn-GFP operon 
(fig. 2) . Out-cross events were screened by growing the clones 
in LB-broth at the permissive temperature of 30°C for two days 
and plating the cells on LBPG plate with chloramphenicol and 
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erythromycin. The non-fluorescent colonies were restreaked on 
plates and taken through a second round of temperature cycle as 
described above. 

After the second round of shuffling, plasmid preparations 
5 were made from 2 0 different clones. Competent PL18 01 was trans- 
formed with each of the 2 0 plasmid preps and plated on LBPG sup- 
plemented with erythromycin. The plates were replicated onto' 
LBPG plates with chloramphenicol and erythromycin to identify 
unwanted clones where both pSX120 and pMB4 3 0 had been trans- 

10 formed. Clones where only the pMB430 plasmid had been trans- 
formed were selected for sequencing of the Savizyn region to 
identify if shuffling had taken place. It is evident from figure 
6 that recombination between the two genes Savinase and Savizyn 
had taken place. All of the 12 sequenced clones show some degree 

15 of recombination and 4 of these can positively be- identified • as 
•being result* of two consecutive double recombination events. It 
is important to note that exchange of Savinase and Savizyn se- 
quence can only be observed* if the shuffled fragment overlaps'^ 
restriction site that is unique for Savizyn. The four clones 

20 with' a positive evidence for two double recombinations is there- 
fore the minimum. , number since the recombination could have oc- „ 
curred between two neighbouring restriction sites. 
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(2) INFORMATION • FOR SEQ ID NO: 1: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: .5313 base pairs 

(B) TYPE: nucleic acid . > .. " 
: (C) STRANDEDNESS:; single'. /' 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA , ; - " ; ; 

(vi) ORIGINAL SOURCE: ' , . ' 

( / (A) ORGANISM: Plasm id pMB4 3 0 

(XI ) SEQUENCE -DESCRIPTION : SEQ ID NO : ' 1.: ■ 

GATCAGTACA AGAAAGATAC TGTA7TTCAT ' AAACAGCAAC. TCCAAGAAGT TAAGGATGAC '60.' 

TTACAGAAGG GAAATAAGCA GT7ACAGACT G G AA 7 AG AG C. ATATGAGCTC TACGAAACCC 12 0 

TfTGATTATG) P AAAATGAGCG'.' TACACCTTTG "TTCTGT.GG AC; GTCAAGAGAC 7GCTAGAAAC' . ' .180 

'ATATTAACTG • CTCATGAATT TGAACCCC.7G CAA.GAAAC.--A TCTCT.TCTGC ' AGAACGGA77- r 240 
_ GTTGATGATT ACGAAAATAT TAACAGCACA. GACTATTACA CAGAAAATCA- AGAATTAAAA '" 300 

AAA CG TAG' AG" ■ AG AGTTTG AA AGAAGTAGTG AATACATGGA AAGAGGCGTA TCACGAAAAA . • ... 360 

' AGTAAAGAGG • TTAATAAATT AAAGCGAGAG -AATGA7AG77 TGAATGAGCA GTTGAATGTA 4'20 

TCAGAGAAA7, TTCAAGCTAG TACAGTGACT TTA7.A7CG7G C7GCGAGGGC GAATTTCCC7' . 480 - 

GGGTTTGAGA AAGGGTTTAA TAGGCTTAAA GACAAA77C7 TTAATGATTC CAAATTTGAG. 540 

CGTGTGGGAC. AGTTTATGGA TGTTGTACAG GATAATGTCC AGAAGGTCGA TAGAAAGCGT 600 

GAGAAACAGC GTACAGACGA TTTAGAGATG TAGAGGTACT TTTATGCCGA GAAAACTTTT 660 
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TGCGTGTGAC AGTCCTTAAA ATATACTTAG AGCGTAAGCG AAAGTAGTAG CGACAGCTAT 720 

TAACTTTCGG TTGCAAAGCT CTAGGATTTT TAATGGACGC AG CG CATCAC ACGCAAAAAC- 780 

GAAATTGGAA TAAATGCGAA 'ATTTGAGATG TTAATTAAAG ACCTXTTTGA GGTCTTTTTT 840 

TCTTAGATTT TTGGGGTTAT TTAGGGGAGA AAACATAGGG GGGTACTACG ACCTCCCCCC - 900 

TAGGTGTCCA TTGTCCATTG TCCAAACAAA TAAATAAATA TTGGGTTTTT AATGTTAAAA 960 

GGTTGTTTTT TATGTTAAAG TGAAAAAAAC AGA7G7TGGG' AGGTACAGTG ATGGTTGTAG 1020 

ATAGAAAAGA AGAGAAAAAA GTTGCTGTTA C777AAGAC7 ' TACAACAGAA GAAAATGAGA 1080 

TATTAAATAG AATCAAAGAA AAA7A7AA7A 77ACOAAA7C AGATGCAACC CC7ATTC7AA 1140 
TAAAAAAATA 7CCAAACGAG GAA7ACGG7G CA7777AAAC AAAAAAAGAT AGACAGCAC7 , 1200 

GGCA7CC7GC C7ATC7A7GA. C 7 AAA 77 77 G T7AAG7G7A7 7AGCACCG7T AT7A7A7CA7 1260 

GAGCGAAAA7 G7AA7AAAAG AAAC7G AAAA CAACA-AAAAT 7CAAGAGGAC C7AA77CGAC . 1320 

A777C7777A 7A7CCACAA7 CAGCAAAACC CGAGTGG77A CAG7A77TAA AAGAG77ACA 13S0 

CA77CAA777 G7AG7G7C7C CA77ACA7CA 7AGGGATAC7 GA7ACAGAAG G7AGGA7GAA 144 0 

AAAAGAGCA7 TA7CA7A77C TAG7GA7G7A 7GAGGGTAA7 AAATCT7A7G AACAGA7AAA 1500 

AA7AAT7AAC AGAAGAA77G AA7GCGAC7A 77CCGCAGA7 7GCAGGAAGT GTGAAAGG7C' 1560 

TTGTGAGATA TATGCTTCAC ATGGACGATC CTAA7AAJ\T7 -7AAA7ATCAA AAAGAAGA7A . ' 1620 

TGATAG777A TGGCGGTGTA ' GA7G7TGATG AA77A7TAAA GAAAACAACA ACAGATAGA7 : 1680 

ATAAAT7AA7 TAAAGAAA7G ATTGAGXTTA TTGATGAACA . AGGAATCGTA GAATTTAAGA 1740 

GT7TAA7GGA TTATGCAA7G AAGT7TAAA7 TTGA7GATTG G77CCCGC7T TTATGTGA7A. . 1800 



«,~wi„.W • PCT/DK97/00567 

WO 98/28416 

" < ■. 34. 



ACTCGGCGTA- TGTTATTCAA -GAATATATAA AATCAAATCG GTATAAATCT GACCGXTAGA 1860 
TTTTGAATTT AGGTGTCACA AGACACTCTT TTTTCGCACC- AG CG AAAACT GGTTTAAGCC 1920 
GACTGCGCAA AAGACGAATT CGAGCTCGGT . ACCGGTAAT.G ACT CTCTAGC TTGAGCCATC 1980 
AAATAAAACG. AAAGGCTCAG ' TCGAAAGACT GGGCCTTTCG TTTTATCTGT. •TGTTTGTCGG 2040 
TGAACGCTCT -CCTGAGTAGG ACAAAT.CCGC jCGCTCTAGCT ^OCAG^GG . CCATCCTGAC ./ 2100 
GGA7GGCCTT TTTGCGCCGC GGAGATCTAA ATATTCG7GG TGGCGCAAGC TTTGTACCAG • 2160 
GGGAACCGTC GAC7CAAGAT GGGAATCGGC , ATGGCACCCA ' .TGTGGCCGGG ACGATCGCTC' 2220 • 
CTTTAAACAA TTCGATTGGC GTT.CTTGGCG TAGCGCCGAG CCCTGACC7A: TACGCTGTTA \ 2280 
AAG7CC7AGG GGCGAGCGG7- TCAGGTTCGG 7CACC7CGA7 .TGCCCAAGGA 77GGAATGGG ^ 2340 , 
CnGCGAACAA 7CGCA7GCAC G77GC7AA77 TGAGITTAGG AAGCCC77CG CCAAG7GCCA .2400 
CAC'TCGAG'CA ACC7C77AA7 AGCGCGAC77. CTAGAG&CGT 7C77G77G7A CCGGCA7C7C 2460'; 
GGAA77CAGG i / 7GCACGC7CA A7CAGC7A7C .'CGGCGCGCTA "TCCGAACCCA A7GGCAG7CG .. 2520 

. GAGC7AC7CA. 7GAAAACAAC AACCG.CGCTA . GCT777GACA- GTATGG.CGCX GGCC77GACA • 2580; 

' 77G7CGGACC CGGGG7AAAC C7CCAGAGCA CATACCCAGG 7TCAACA7A7 GCCAGCTTAA. 2 640 
' ACGG7ACA7C GA7GGC7AC7 CC7CA7G77G CAGG7GCGGC CGCCCTTCTT AAACAAAAGA .2700 
■ ACCCATC77G G7CTAA7GTA CAAA77CGAA' A7CA7C7AAA . GAA7ACGGCA AC7AG7TTAG 2760 
GAAGCACGAA C77G7A7GGA AGCGGAC77G 77AACGCAGA AGCGGCAACG CG77AAAAA7 ' 2820 
GAGGAGGGAA GCTTTATGAG TAAAGGAGAA GAAC7777CA CTGGAG77GT CCCAA77C77 2880 
G77GAATTAG ATGGCGATGT TAA7GGGCAA AAA77C7CTG TTAGTGGAGA GGGTGAAGGT 2940 
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GATGCAACAT AC3GAAAACT TACCC7TAAA TTTATTTGCA CTACTGGGAA GCTACCTGTT 3000 

CCATGGCCAA CGCTTGTCAC i TACTCTCTCT TATGGTGTTC AATGCTTTTC TAGATACCCA 3060 

GATCATATGA AACAGCATGA CTTTTTCAAG AGTGCCATGC CCGAAGGTTA TCTACAGGAA 3120 

AGAACTATAT TTTACAAAGA TGACCGGAAC' TACAAGACAC G7GCTGAAGT CAAGTTTGAA . 3180 

GG7GATACCC TTGTTAATAG AA7CGAG77A AAACGTATTG ATTTTAAAGA AGATGGAAAC 3240 

ATTCTTGGAC ACAAAATGGA ATACAA77A7 AAC7CACA7A ATGTATACAT CATGGCAGAC 3 300' 

AAACCAAAGA ATCCCATCAA AG77AAC77C AAAATTAGAC ACAACATTAA ACATGGAAGC 3 3 60 

G7TCAA77AG CAGACCA77A 7CAACAAAA7 AC7CCAA77G CCCA7CGCCC 7C7CC7T77A 3420 

CCAGACAACC A77ACC7G7C CACGCAATCT GCCC777CCA AAGA7CCCAA CGAAAAGAGA 3480. 

CA7CACA7CA 7CC77C77CA C777C7AACA CC7CC7CGCA TTACACATGC CATGGATGAA 3540 

CTATACAAA7 AAAACAAC77 AACCCA7CCA C777A7CCA7 CCCTTAACTT AC77A77AAA 3600 

7AA777A7AG C7A77GAAAA GAG A7AAG AA 77G77CAAAG C7AA7A77G7 77AAA7CG7C 3650' 

AA77CC7GCA 7G7777AAGG AAT7G77AAA 77GA777777 G7AAA7A777 7C77G7A77C 3720 

T77G77AACC GA777CA7AA CGAAA7AA77 A7AC7777G7 T7ATCTTTGT G7GA7A7TC7 3780 

7GA7777777 C7AC77AA7C 7G A7AAG7C A CC7A77CAC7 77AGG777AG GA7GAAAA7A 3840 

TTCTCTTGGA ACCATACTTA A7A7AGAAA7 ■ A7GAAC77C7 GCCAT7AAAA G7AA7GCCAA 3900 

TGAGCG7777 G7A777AA7A A7C7777AGC AAACCCGTA7 7GCACGA77A AA7AAA7C7C 39 60 

ATTAGC7A7 ; A C7A7CAAAAA CAA7777GCG 7A77A7A7CC. G7AC77A7G7 .7ATAAGGTA7 . 4020 

AT7ACCA7A7 A7777A7AGG A77GG77777 AGGAAA777A AACT.GCAATA 7A7CC77G77 4080 
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TAAAACTTGG AAATTATCGT GATCAACAAG TTTATTTTCT GTAGTTTTGC ATAATTTATG 4140 

GTCTATTTCA > ATGGCAGTTA" CGAAATTACA- CCTCTTTACT AATTCAAGGG TAAAATGGCC 4200 

TTTTCCTGAG CCGATTTCAA AGATATTATC ATGTTCATTT AATCTTATAT t TTGTCATTAT • 4260 

t TTTATCTATA TTATGTTTTG AAGTAATAAA GTTTTGACTG TGTTTTATAT TTTTCTCGTT . : _ 4320 

CATTATAACC CTCTTTAATT TGGTTATATG AATTTTGCTT ATTAACGATT CATTATAACC 4380 

ACTTATTTTT TGTTTGGTTG ATAATGAACT GTGCTGATTA CAAAAATACT AAAAATGCCC • 4440 

ATATTTTTTC CTC.CTTATAA .. AATTAGTATA AiTA?A:VCAC GACCT.CTGAT; , AAATATGAAC 4 500 

ATGATGAG.TG ATCGTTAAAT. TTATACTGCA' ATCGGATGCG ATTA77G AAT AAAAGATA7G . 4 5 60 

AGAGATTTAT CTAAT TXCT7 TTTTCTTGTA AAAAu\ACAAA CTTCTTAAAG G77TTATAG7, 4 6'20 
TTTGGTCGTA GAGCACACGG TT7AACGAC7 TAATTACG'AA CTAAATAAGT CTAGTGTGTT . .,'4680 

ACACTTTATG AAA7C7ATA7: ACGTT7A7AT A7AT77A7TA 7CCGGAGG7G 7AGCATGTC7 4 7 40 ; 

CA77CAA777 TGAGGGTTGC CACACT7AAA GGATCAAGT.A '■ A.TACAAACGG> GA7ACAAAGA 4 4 800 
CATAATCAAA' GAGAGAA7.AA .AAAC7ATAA7 AA7AAAGACA 7AAATCA7GA' GG AAA CAT AT:' v ' 4860" 

AAAAA7TATG ATT7GA77AA CCCACAAAAT ATAAAGTA7A AAGATAAAAT ' TCATGAAACC" .4 9 20' 

. AT7GA7GAGA. ATTATTCAGG GAAACG7AAA A7TCGG7CAG, ATGCAAT.TCG: ACATGTCGAC- ■ 4980- 

GGAC7GGTTA CAAGTGATAA AGATITCTTT GATGATTTAA GCGGAGAAGA AATAGAACGA 5040 

TTTT77AAAG ATAGC77GGA . G777CTAGAA AATGAATACG G7AAGGAAAA .TA7GCTCTAT 5100 

GCGACTGTCC ATCTGGATGA AAGAG7CCCA CA7A7GCAC7 TTGGTTTTGT CCCTTTAACA 5160' 

GAGGACGGGA GATTGTCTGC AAAAGAACAG 77AGGCAACA AGAAAGACTT TACTCAATTA 5220 
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CAAGATAGAT TTAATGAGTA TGTGAATGAG AAAGGTTATG AACTTGAAAG AGGCACGTCC 5280 
AAAGAGGTTA CAGAACGAGA ACATAAAGCG ATG 5313 
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Claims ! 

1. A process for in vivo recombination of homologous DNA se- 
quences, comprising the following steps 
5 a) incubating a cell containing at least two DNA struc- 

tures, each comprising a DNA sequence and an origin' of 
replication, under conditions which favour integration 
" of one of the DNA-structures into one of the others, 
thereby forming a hybrid DNA structure; ;** 
10 b) incubating the cell under conditions, which favour, cross 

ing but from said hybrid DNA structure, thereby forming 
: novel DNA structures, each 'comprising a recombined DNA 
' ' sequence and an origin of replication;-, 
c) repeating- steps a)-b) at least once. 1 • 

L5 ' \ " ' 1 ; ■ ; ■ ' . \ • • • ; . ■ 

2. A process' for in vivo 'recombination of homologous". DNA se- 
quences, comprising the steps • -• 

■a).. ' incubating a cell containing at least two DNA struc- 

* t.ures,' each comprising _a ..DNA. sequence and an' origin' of 
!0 ■ replication, under conditions where i) at least one ,ori- 

' gin of replication is non-functional and ii),. the cell is 
. under a .selective pressure which only allows growth of v 
■ the cell if it .contains the DNA -structure * which is hav- 
ing the non-functional origin (i.e.' favoring/ an integra- 
5 tion of one of ■ the-- DNA-structures into one of the oth- 

ers thereby forming a' hybrid -DNA structure); , ' 

b) • changing of said conditions to conditions, where fewer 

origins ,of replication are noh- functional .than- in .step 
a) .(i.e/ favoring .a crossing . out from said hybrid DNA 
0 •*' structure); ; ' ' " "''*'■*" \ : - ;~ 

c) repeating' steps a)>b) at. least once. 

3. The process according to claim 1-2, characterized in that 
at least one DNA structure is a chromosome, preferred a bacte- 

5 rial chromosome, more preferred a Bacillus chromosome, espe- 
cially a Bacillus subtilis chromosome. 
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4". The process according to claim 1-3, characterized in that 
at least one DNA structure is a vector, preferred a bacterial 
vector, more preferred a vector able to replicate in a Bacillus 
cell, especially a Bacillus subtilis cell. 

5 

- 5. The process according' to claim 4, wherein at least two of 
the DNA constructs are two different vectors. 

6. A process for in vivo recombination of homologous DNA se- 
10 quences, comprising the steps 

a) inserting the homologous DNA sequences in at least two 
different vectors, said vectors contain different origins 
of replication; 

b) transferring the vectors into a cell; 

15 c) incubating the cell under conditions where i) at least one 
, origin of replication is non-functional and i.i) the cell* 
is under a selective pressure, which only allows growth of 
the.cell if it contains the DNA structure which is having 
the non-functional origin (i.e. favoring an integration of 
20 one of the DNA-structures into one of the others, thereby 

- forming a hybrid DNA structure)'; * 
d) changing of said conditions to conditions, where fewer " 
origins of replication are non-functional than in step c)' 
(i.e. favoring a crossing out from said hybrid DNA struc- 
2 5 t ture) ; . 

- e) - repeating -steps c) -d) as -many . tines, -as wanted, however at.~ 
least once. 

7. The process according to 'claim 1-6, characterised in that 
30 at least one origin' of replication is selected in such a way 

that the replication rate of the vector can be regulated chemi- 
cally or physically. . 

8. .The process according to claim 1 , characterised in that at 
35 least one origin of replication is temperature sensitive . 
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9.- The process according to claim 8, characterised in that the 

• replication rate of at least one vector can be regulated by a 
temperature shift, 

5 10. The process according to claim 1-9, characterised in that 

the vectors contain genes for markers, preferred selectable 
.markers, especially antibiotic markers. 

i.l. The process according to claim 10, characterised in that the 
10 vectors contain genes for different markers . 

,12. The process according to any. of the preceding claims, 
wherein at least one DNA structure do not comprise ori(-)' (i.e. 
only comprise Ori'{ + ) ) •/ 

• 13. The process according to claim 12, wherein the DNA . structure 
.which do not comprise Ori(-) is a DNA structure which origin 

\ (i\e. herein Ori(+)) is functional during . the .hybrid" DNA struc- 
ture formation performed according a process according to any of 
20' the preceding claims. 

• i 4 . The ' process according to any of the preceding claims, 
wherein It is formation of the hybrid structure is measured in- . 
vivo, preferably " by, having a system of DNA structures, wherein 
25 an in vivo screenable protein is "expressed only after the- hybrid 
structure has been forced. ' • 

15. The process according to. claim 14, wherein, said -screenable 
protein is, a. protein which is fluorescent under suitable expo- 

30 sure, and' in particular said fluorescent protein is Green Flo-^ r>i 
rescent protein (GFP) or variants thereof. 

16. The process according to any of the preceding claims, 
wherein crossing out from the hybrid DNA structure is a forced 

35 crossing out event. 

17. The process according to claim 16, wherein the forced cross- 
ing out from the hybrid structure is performed by: 
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i) constructing a system wherein the DNA hybrid struc- 
ture dp not .express a. selectable marker, and at 
least one of the at least two DNA , structures accord- 
ing to any of the preceding claims is able to ex- 
5 press said selectable marker; 

, ii) incubating the cell under selective conditions, 

thereby forcing a cross out of the DNA hybrid struc- 
ture, since the cell is only able to express the 
marker gene after the crossing out of the DNA hybrid 
10 structure has taken place. 

18. The process according to 17, wherein the selectable marker 
in step i) is expressed on a DNA; structure where the origin of- 
replication is non-functional during a forced integration of the 

is DNA structures into a DNA hybrid structure performed as de- 
scribed above, and preferably the selectable marker gene is anl 
antibiotic resistance gene! , $ 

19. The process according to any of the preceding claims, W 
20 wherein the homologous DNA sequences comprise DNA sequences from 

at least one pre-made DNA library. 

. . • ' . ■ • ■ ' . f " 

20. The process according to any of the preceding claims, 
vherein the homologous DNA sequences are comprised of two or ' :; 

25 more different set of homologous DNA sequences ("set A" "set B", 
"setC" etc.*)., and wherein each of said set of homologous se- 
quences ("set A" "set B" , "setC" etc.) are inserte'd in a differ- 
ent DNA construct ( "DNA -construct A", "DNA construct B" , "DNA 

. construct, C"' etc. . ) according to any of. the preceding claims. 

30 

21. The process according to claim 20, wherein at least one of 
said' sets of homologous DNA sequences is derived from a pre-made 
DNA- library, and at least one of said sets of homologous DNA se- 
quences is a DNA sequence which' are homologous to said DNA li- 

35 brary (i.e. resulting in a process for in vivo recombination of 
said DNA library to a DNA sequence which are homologous to said 
DNA library) . 
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22 . The process according to 20, wherein said sets of homologous 
• DNA sequences is derived: from the same pre-made DNA library 

(i.e. resulting in a process for in vivo/ recombination of said 
DNA library) . ; . t ^ ' ( ■ ; - 

s ' /. • . ". 1 ' • '-; • y '. 1 t . ./ : ,-./- ;; , . l '■ ... ..• • ■ . 

23 . The process according to claim 20, wherein said sets of . Ho- 
mologous DNA .sequences are derived' from at least "two different- 
pre-ihade DNA • libraries (i.e. resulting in a process for in vivo 
recombination of at least two different DNA libraries) . • 

24.. The process according to claim 20, wherein said sets of ho- 
mologous DNA sequences each only • contain one DNA sequence (i.e. 
recombination- of .two or more homologous sequences) .-., 

25'. The process' according to any of the preceding claims , 
. wherein two sets of homologous:; DNA * sequences., ( "set ,A" and: "set 
. B" ) are recombined, in vivo,, 'comprising, the steps . '• . ... 

a) ;. inserting sequencer's) . from "set A" in a. vector -contain-- 

• ing a temperature resistant origin, of replication/ and -a;> 
3 v - gene." for a first - marker " • '; ' • c . ■ 

b ) ' inserting . sequence(s).: from "set B" in a vector.' contain- 

.: '' ing a . temperature; sensitive origin '.of replication, and- a * 
,: gene for' a second, marker; • . • ' : ' ' ; t ] : \ 

c) ' ' -\' transferring the vectors' into a cell; • . , • . 

5' d) incubating, -the -cell at a temperature where .both origins . 

'• ' of replication, are functional, optionally under -a selec- 
;•' - ■/ tion pressure! that favours cells with 'both marker' genes; 
■ . e) .. tJ shifting the temperature: to a temperature-, where/ only ; \ 

■ . 1 . . the temperature ' resistant origin of replication is func- 
o - tional ; ■ .., ( . - . . 

f ) ■ 'repeating steps d)-e) at least once. 

26.. The process according to any of the preceding claims/, 
wherein two sets of homologous 'DNA sequences ("set A">, and "set 
5 B u ) are recombined in vivo, comprising the steps 

a) inserting seguence(s) from "set A" in a vector containing a 
temperature resistant origin of replication and a gene for a 
first marker; 
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b) inserting sequence(s) from •'set' B" in a vector contain- 
ing a temperature sensitive origin of replication and a 
gene for a second marker; 

c) transferring the. vectors into a cell; 

d) incubating the cell at a temperature where only the tem- 
perature resistant origin of replication is functional, 
optionally under a selection pressure that favours cells 
with both marker genes; 

e) optionally, shifting the temperature to a temperature, 
where both origins of replication are functional ; 

f) repeating steps d)-e) at least once. 

27. The . process according to claim 1-26, characterised in that 
the cell containing the DNA structures is defective or transito- 
rily inactivated in the ' enzymatic mismatch repair system or has 
a reduced level of mismatch; repair. . 

28. The process according to claim 1-27 , characterised in "that 
the cell is a Gram positive cell/ such as a Staphylococcus 'cell, 
a Streptococcus cell, a Bacillus cell, or in particular a Bacil- 
lus subtilis cell'. - 

29. The process, according to claim" 1-28, characterised in that 
the cell is a Gram negative cell, preferred an Escherichia cell.,, 
especially an E. coli cell. 

30. The process according to claim 1-29, characterised in that 
the /homologous DNA sequences are encoding polypeptides , espe- 
cially enzymes, and in. particular enzymes such as an amylase,, 
lipase,- cutinase, cellulase, oxidase, phytase, and a protease. 

31. A method for producing one or more recombinant protein (s) 
having a desired biological' activity comprising: 

a) * performing a process for in vivo recombination of 

homologous DNA sequences according to any of the preceding 
claims, wherein at least one of the DNA structure (s) is an 
expression vector; 
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b) " expressing the. numerous different recombinant polypeptides 

encoded by the numerous different recombined DNA sequences 
from step a); and 

c) screen or select the numerous different recombinant 
proteins from step b) in a suitable screening or selection 
system for one or more recombinant polypeptides (s) having 
a desired activity. 



32 . 



^ set of at least two vectors, where at least two vectors 
io are comprising at least two individually different .origins of 
replication able to function in the same cell under permissive 
conditions, and two homologous DNA' sequences encoding polypep- ■ 
tides, said DNA sequences having more than 50% homology, prefer- 
able moire than 7 0%. 
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33. A vector system, comprising at least . - 

a) a vector A containing a DNA sequence encoding a polypep- 
tide and an, origin of replication 

b ) a: vector b' containing a DNA sequence homologous to the 
, .DNA sequence in vector .A, ; and an. origin of replication 

that differs from the origin of replication in- vector A, 



■ 34. The- vector system according .to claim 33, characterised in 
that the DNA sequences encodes polypeptides with a biological 
25 activity, especially enzymes:.' ,, 
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11 containing a vector system according to claim 33-34 



36-. The cell according to claim 35, characterised iV that is a 
30 Gran positive cell, preferred a Bacillus cell /' especially -a Ba- 
cillus subtilis cell. 

37. The cell according to claim 35, characterised in that the 
cell. is a Gram negative cell, preferred an Escherichia cell, es- 

35 pecially an E. coli cell. 

38. A recombined DNA sequence, characterised in that it is a 
result of a process according to one or more of the claims 1-30, 
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or it is encoding a recombinant protein having a desired bio- 
logical activity, wherein said recombinant protein having a de 
.sired biological* is produced by the method according to claim 
, 31. 

5 ' ' . ' - ' ... ; 

39. A process, for producing a polypeptide, especially an en- 
zyme, comprising i) growing a cell expressing a- polypeptide en- 
coded by a recombined DNA sequence according to claim 38, and 
ii) purifying said expressed, polypeptide. ■ 

40. A polypeptide, especially an. enzyme, characterised in that 
it is produced by a process' according .to claim 39. 

'41. A polypeptide, especially an enzyme, characterised in that 
15 it is encoded by the recombined- DNA sequence according, to clain 

■ 38 . '. • . • ,. • .' . 
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